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ABSTRACT 



The systematic errors in the virial mass-to-light ratio, M^/L, of galaxy clusters as 
an estimator of the field M/L value are assessed. We overlay 14 clusters in redshift 
space to create an ensemble cluster which averages over substructure and asymmetries. 
The combined sample, including background, contains about 1150 galaxies, extending 
to a projected radius of about twice r2oo- The radius r200) defined as where the 
mean interior density is 200 times the critical density, is expected to contain the 
bulk of the virialized cluster mass. The dynamically derived M(r2oo)/-^(^20o) of the 
ensemble is (0.82 it 0.14)(Aft,/L). The M^/L overestimate is attributed to not taking 
into account the surface pressure term in the virial equation. Under the assumption 
that the velocity anisotropy parameter is in the range < /3 < ^/a, the galaxy 
distribution accurately traces the mass profile beyond about the central 0.3r2oo- There 
are no color or luminosity gradients in the galaxy population beyond 2r200) but there 
is 0.11 lb 0.05 mag fading in the r band luminosities between the field and cluster 
galaxies. We correct the cluster virial mass-to-light ratio, M^/L = 289 it SO/iMq/Lq 
(calculated assuming = 0.1), for the biases in and mean luminosity to estimate 
the field M/L = 213 it 59/iMq/Lq. With our self-consistently derived field luminosity 
density, j / pc = 1136 it 138/iM0/L0 (at z ~ Ya), the corrected M/L indicates 
r^o = 0.19 lb 0.06 lb 0.04 (formal la random error and estimated potential systematic 
errors) for those components of the mass field in rich clusters. 
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1. Introduction 



The existence of dark matter was discovered in galaxy clusters where the velocity dispersions 
are nearly an order of magnitude higher than expected from the gravitational binding provided by 
the stellar masses of their visible galaxies ( Zwicky 1933 , Smith 1936 , [Zwicky 1937 , ^chwarzschild 



1954 ). Clusters are gravitationally bound, quasi-equilibrium systems assembled over a Hubble time 



via the infall of the mass in the surrounding field ( |van Albada 196d| , |Abell 1965| , [Peebles 197d| , [Gott| 



fc Gunn 197^ , [White 19761 [Fillmore fc Goldreich 1984 [Bertschinger 1985|) , which implies that 
both the dark matter and the galaxies within clusters have their origins in the field. Because 
clusters are large systems that draw their mass and galaxy content from regions 20h~^ Mpc across, 
measurements of cluster AI/L values should be representative of the field value, although not 
necessarily identical to it because of differential galaxy evolution. The product of the field M/L 
with the field luminosity density, j, is equal to the mean mass density of the universe, po ( [Oort| 
1958). The cosmological density parameter, ^Iq = po/pc, is therefore estimated as the ratio of the 



cluster M/L (corrected to the field) to the {M/L)c = Pc/j for closure {e.g. |Gunn 1978[) . The 
resulting Q. estimate has no dependence on Hq for dynamically measured cluster masses ( pott fc| 



Turner 1976 ). The purpose of this paper is mainly to correct our cluster virial mass-to- light ratio, 
M^/L, (Carlberg, et al. 1996, hereafter |C9|) to the field M/L. 



Cluster dynamical masses are usually calculated from the virial mass estimator, M„, which 
has the drawback that it makes explicit assumptions about the dynamical state of the clusters. Its 
reliability is critically dependent on the galaxy population being in dynamical equilibrium with the 
cluster potential and the galaxy distribution having the same spatial distribution as the total mass 
distribution of the cluster. Measured virial mass-to- light ratios of clusters, M^/L, are generally in 
the range 200 — OOO/iMq/L© {e.g. Punn 197g , Ramella, Geller fc Huchra 1989| ) which, in ratio to 
{M/L)c ~ 1500/1 ( [Efstathiou, Elhs fc Peterson 1988| , [Loveday et al. 1993| ) indicates 17 ~ 0.1 - 0.4. 
Cluster r^o values have not generally been accepted as conclusive because there are a number of 
uncontrolled sources of possible systematic error. For instance, cluster galaxies with blue colors 



are known to have a higher velocity dispersion and are more extended than the red galaxies ( Rood 



et al. 1972 , Kent Sz Gunn 1982 ) leading to virial mass estimates that are about a factor of four 



different for our dataset. The substantial color difference between cluster and field galaxies opens 
up the possibility that their luminosities per unit mass are quite different. The assumptions which 
go into the cluster 0, calculation need to be tested for their validity for a specific sample. 

The possibility that the mass distributions of galaxy clusters are more extended than their 
constituent galaxy populations has been recognized for many years (Limber 1959). The detection 
problem is that dynamical mass estimators do not have any useful sensitivity to cluster mass 
outside the orbits of the galaxies. A specific situation which leads to this bias is found in N-body 
simulations of cluster buildup in an Q = 1 cosmology with a realistic amount of substructure (West 



fc Richstone 198j , parlberg fc Dubinski 199T| , parlberg 199^ , [Katz fc White 1993| , |van Kampen 



1995 ). In this case the simulated "galaxies" become more concentrated within the cluster than its 



mass. That is, the virial mass calculated from the "galaxies" systematically underestimates the 
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total mass of the cluster in these simulations by as much as order of magnitude. 

There are some observational indications of the possibility of a higher 17 than that calculated 
from virial masses. Detailed mass modeling for the Coma cluster gives a projected mass of 
5.8 X lO^^h^^ Mq within 3° of the center (with a normal mass-to-light ratio, [Kent fc Gunn 1982 ), 



whereas the virial mass is generally taken to be 2.1 x 10^^h~^ Mq for the same region. However, 
this can also be ascribed to the substantial cluster to cluster variations in the shape of the velocity 
dispersion profile ( |den Hartog 1995| ) due to the asphericity and substructure within individual 
clusters. Another indication comes from substructure, which is a complication in the analysis 
of clusters as individuals, but can be turned to advantage as an indirect, model dependent, Q 
estimator. The results of substructure analysis remain controversial: some workers favor ~ 1 
(Richstone, Loeb & Turner 1992, Mohr et al. 1995| ), while others suggest much smaller Q, values 



(ITsai fc Buote 1995| ). 



The CNOC (Canadian Network for Observational Cosmology) cluster redshift survey is 
designed to derive a cluster Oq within a homogenous, self-contained, sample where we can explicitly 



test for systematic errors. The catalogues contain ~2600 redshifts in the fields of 16 clusters (Yee, 
lEhingson &: Carlberg 1996| ) from which [C9^ ) derived an average (My/L) = 289 ± SO/iMq/L© 



(calculated with qq = 0.1). The clusters are consistent with having a universal M^/L value 
(within the errors of about 25%) independent of their velocity dispersion, mean color of their 
galaxies, blue galaxy content, or mean interior density. The field galaxies in the dataset, with the 
same corrections, over the same redshift range, yield the closure mass-to-light ratio, pd to be 
1136 lb 138ft. Mq/Lq. Consequently the virial mass-to-light ratio implies f^o = 0.25 it 0.05 at a 



mean redshift of 0.32, where the error is the formal la random error. In C96 the derived Q was 
erroneously corrected to a redshift zero VLq. The calculation uses a co- moving volume element (go 
dependent) and Hubble constant which are both redshift zero values which do not need further 
corrections. 

Testing the accuracy of the virial mass was a primary consideration in the design of the 
CNOC observations. The data allow us to derive a radially resolved projected number density 
profile, T,n{R), and a projected velocity dispersion, ap{R) (in the following, R is used for projected 
radius; whereas r denotes the spherical co-ordinate). The sample meets the two main requirements 
for deriving a radially resolved mass profile which are, first, accurate control of the background, 
and second, enough clusters to average over the aspherical complications of individual clusters. 
In Section 2 we review the properties of our sample. We define scaling velocities and radii which 
we use in Section 3 to combine all but two "binary" clusters to create an "ensemble cluster". 
This averaging increases the signal-to-noise ratio of our measurements, and is sufficient to justify 
the assumption of an equilibrium, spherical, cluster in the analysis. The clusters are sampled 
to densities in redshift space that are comparable to the field density, requiring a background 
subtraction which is discussed in Section 4. The mean surface density profile of galaxies within 
the cluster is derived in Section 5, and fitted with a spatial density profile. The projected velocity 
dispersion profile is derived in Section 6, and fitted to a radial velocity dispersion function for a 
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variety of orbital shape assumptions. Using these results the mass-to-light profile is derived from 
Jeans equation in Section 7. In Section 8 we examine the relative colors and luminosities of field 
and cluster galaxies as a measurement of the differential evolution of the two populations. The 
corrections and error budget are presented in Section 9, followed by our conclusions and discussion 
in Section 10. 



2. Sample and Observations 

The CNOC sample is designed to create a dataset that allows as complete internal control of 
all aspects of the cluster Q estimate as possible. It is essential that the sample be useful for a test 
of the equilibrium hypothesis and whether the virial mass is biased in some way. Furthermore the 
sample is used to measure differential luminosity evolution between cluster and field galaxies. On 
the basis of some n-body simulation data ([Carlberg et al. 1994| ) it was argued that these could 
be best met within the constraints of available observational resources with a set of a dozen or so 
clusters at z ~ 1/3 with a total of 3000 or so accurate redshifts. 

The cluster sample was chosen from the Einstein Medium Sensitivity Survey Catalogue of 
X-ray clusters ( pioia et al. 1990 , Henry et al. 1992, Gioia & Luppino 1994) to have a high 



X-ray luminosity, L-^ > 4 x 10^^ erg s~^, which helps guarantee that the clusters are reasonably 
virialized and have relatively high masses, making them easier to study. The clusters chosen. 
Table |l], are at moderate redshifts, 0.17 < z < 0.55, which has a number of advantages for mass 
estimation. They are sufficiently distant that they have a significant redshift interval over which 
the density of foreground and background galaxies are nearly uniformly sampled in redshift. At 
z ~ Ys, a typical cluster diameter of 3/i~^ Mpc (comoving) spans an angle of about 12', which is 
sufficiently small that uniformity of photometry and sample selection is relatively easily assured. 
This angle is also comparable to the field size of the Canada-France-Hawaii Telescope (CFHT) 



Multiple-Object-Spectrograph (LeFevre et al. 1994), approximately 10' square. 



Observations were made at CFHT in 24 assigned nights in 1993 and 1994, of which 22 
were usefully clear. The sample and observational techniques are described in detail elsewhere 



(Yee, EUingson & Carlberg 1996). The primary catalogues contain Gunn r magnitudes and 
g — r colors for 25,000 objects, of which about 2600 have velocities, on the average accurate 
to about 100 kms~^. About one-third of the sample is cluster galaxies. The other two-thirds 
are field galaxies, although less than half of them are within the fair sample region defined by 
the band-limiting filter's passband and a Az = 0.01 buffer zone added to the upper and lower 
cluster redshift range. For reasons of observational efficiency, the survey is not "complete" in 
the usual sense. Hence, each object with a redshift has calculated weights which are the inverse 
of the magnitude selection function and the magnitude-dependent geometric selection function 
( Yee, EUingson &: Carlberg 1996| ). There are small changes in the cluster dynamical parameters 



of Table [l| from those in the global analysis ( p96| ). The values here are derived using the final 
survey catalogues which accounts for the small changes from those given in earlier papers. All 
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calculations in this paper assume Hq = 100 km s ^ Mpc ^, Qq = 0.2 and A = 0, although these 
choices do not affect our main conclusions in any significant way. 



3. Dynamical Parameters of the Clusters 

The primary goal of this paper is to measure any systematic biases in M^/L as an estimator 
of the field value. This is done by independently measuring the mass and constraining differential 
luminosity evolution of cluster and field galaxies. We adopt the virial mass, 

= ^alr,, (1) 
as the standard estimator, where cJi and are defined below in Equations ^ and |3|. The 



drawback of the virial mass is that it is a statistically "inefficient" estimator (Bahcall fc Tremaine 



1981 , Beers, Flynn fc Gebhardt 1990D . Its positive features are that it is completely independent 



of the distribution of orbital shapes and it is not unduly sensitive to background contamination. 
An alternate estimator is the projected mass ([Bahcall &: Tremaine 1981 ), which we calculated 



but do not report. Its main drawback is that it is quite sensitive to background contamination 
of the data at large radii. For the standard velocity ellipsoid shape the projected mass is always 
larger than the virial mass, which we find below already overestimates the cluster mass. Hence, 
we adopted the virial mass as our standard estimator. 

The first step in testing the accuracy of My is to construct a low-noise, effectively spherical, 
"ensemble" cluster, which averages over internal substructure and the variation of projected 
quantities with viewing angle. Our clusters span about a factor of two in velocity dispersion and 
hence are not a uniform set of physically identical objects. Simply overlaying all the clusters onto 
a common center of mass in physical velocity and position units has some significant advantages 
(an approach that will be pursued in another paper), but has the disadvantage that any gradients 
of density or velocity dispersion tend to be blurred out. The natural scaling parameters are each 
cluster's RMS velocity dispersion and the characteristic radius at which the cluster is expected to 
be in an effective equilibrium, which turns out to be a function of the RMS velocity dispersion, 
alone, within our approximation. 

Deciding which galaxies in redshift space are cluster members is fundamentally ambiguous. 
That is, a cluster galaxy with a completely reasonable line-of-sight velocity of 1000 kms~^ appears 
in redshift space at 10h~^ Mpc from the cluster's center of mass, far outside the virialized cluster 
and overlaying field galaxies. This complication increases in severity at large distances from the 
cluster center, which our sample is specifically designed to probe. One straightforward solution 
to this problem, which we adopt, is to explicitly subtract the mean density of field galaxies in 
redshift space from the cluster distribution. 



The internal kinetic energy of the cluster is calculated from the characteristic velocity 
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dispersion, ai. We iterate the classical estimator 

^?=(E^^) 5]u;,(A^;,)^ (2) 

where the Avi = c{zi — z)/(l +'z) are the peculiar velocities in the frame of the cluster and z 
is the weighted mean redshift of the cluster. The weights are calculated from the redshift and 



photometric catalogues to allow for statistical incompleteness of sampling ( Yee, Ellingson & 



Carlberg 1996 ). The key to the use of this estimator is to have an objective choice of the galaxies 
that are likely to be cluster members. The adopted method ( p96| ) is as follows. First a choice of 
the cluster redshift range is made from which a trial o"i is calculated. Then all galaxies between 
5 and 15 o"i are used to make a background estimate, which is subtracted from the weights of 
the galaxies selected to be in the cluster. If the ai calculated from the background subtracted 
weights is within the 68% bootstrap confidence range of the trial ai ( [Efron &: Tibshirani 198^ ), 
then the procedure is stopped; otherwise, the redshift limits are increased or decreased to find a 
convergence. 

Precisely the same galaxies as determined to be "in the cluster" from our cji calculation are 
given to the iterated bi- weight estimator ( [Beers, Flynn fc Gebhardt 19901) . The resulting estimates 
of the velocity dispersion are, on the average, 1.073 it 0.019 times larger than from our method. 
We find below that the global mean velocity dispersion of the combined dataset, normalized with 
our 0"! values, is 1.05 it 0.04, whereas it should be unity, which is consistent with either scheme for 
calculating the velocity dispersions. Since the outcome of this investigation is that My is biased 
upwards, the bi-weight calculation would lead to a further increase in the bias. 

For each cluster a "ringwise" estimate of the virial radius of the observed distribution is 



calculated (see C9(: for details): 



where /c?^ = 4i?ji2j/(i?j + ^j)^ and K{k) is the complete elliptic integral of the first kind in 
Legendre's notation ([Press et al. 1992|) . The angular extent of the sample is set by the observed 
field size. The redshift range is as found in the velocity dispersion calculation. 

The goal is that our data will include the entire virialized mass of the cluster. Analytic models 
( Gott fc Gunn 1972[ ) and simulation data ( pole &i Lacey 1996 , Zembrowski & Carlberg 1996) find 



that the virialized mass is generally contained inside the surface where the mean interior density 
is 200/?c at the redshift of the cluster. The mean interior density within our measured r„ is 

p{ry) 1 3M„ 



Pc{z) pc{z) AtttI ■ 
In terms of the measured dynamical parameters this is 

'p{rv) _ 6o-f 
Pc{z) H^{z)r^' 



(4) 



(5) 
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where the Hubble constant at z is ^^(z) = iJ^[r2o(l + zf + ^^(l + zf + I^a] ( Peebles 1993| ). 



Normally we take r^o = 0.2, I^a = and = 1 — l^o for an open FRW model. The mean 
density inside r^, serves as a check as to whether the radial extent of the cluster is sufficiently 
sampled to determine a virial radius that includes most of the virialized mass. Table Q gives the 
values for r^,, cji, and 'p{rv) / Pc{z), in columns 3, 4 and 5, respectively. The sampling over-density 
7>{^v) / Pc{z) varies substantially from cluster to cluster, which mainly indicates the radial extent of 
our coverage of a given cluster. The observations used here extend beyond r2oo for rnost clusters. 
The exceptions are MS1006+12, MS1008-12, and MS1455+22 where the data extend to about 
2/3 to 3/4 of raoo- 

The cluster's virial mass to k-corrected Gunn r luminosity, M^/L, and its errors are given in 
columns 6 and 7 of Table |l|. The numbers are slightly different than those found in C96| because 
these results are based on the finalized catalogues. 

To calculate raooj the radius where the mean interior density is 200pc{z), requires an 
assumption as to how the mass is distributed. One possibility is that the mass is strongly 
concentrated to the center of the cluster such that all the mass measured by My is inside the 
measured Vy, in which case, 



^200 



(6) 



2mpc{z 

This definition of raoo is not particularly appropriate when ]}/ pc{z) < 200, as it is for our best 
sampled clusters, because the cluster galaxies, and presumably the mass as well, extend beyond 
r2oo- A superior r2oo is calculated assuming a density-radius model which is extrapolated to the 
desired overdensity. Simplicity, and a great deal of work on clusters of galaxies and the properties 
of dark halos in general, suggests that the first approximation for the density-radius relation is the 
singular isothermal sphere, p oc r~^, in which case we define, 

p{rv) ^^^'^ 



r200 

or equivalently. 



200pc{z) 



(7) 



= lo^- 

2/3 1 /3 

Note that rggo cfi fy and raoo oc ai. That is, raoo is independent of the angular extent of 
the sampling, in so far as the cluster's ai value has no radial dependence, which for the shallow 
gradients of the velocity dispersion profile found below is effectively true. At z = V3 a low density, 
flat universe gives a value of raoo about 13% larger than in a open universe. 

Values of rggo and r2oo are given in columns 8 and 10 of Table |l] in units of physical h^^ Mpc 
for our cluster sample. For our sample these two r2oo estimates have very similar values, so the 
results are not sensitive to the mass model. The jacknife error estimates for rgpo and raoo in 

Mpc are given in columns 9 and 11, respectively. All of the following analysis will use raoo as 
the scaling radius. 
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Two of the clusters, MS0906+11 and MS1358+62, are strong binaries, the first having an ill 
determined velocity dispersion and the second having a massive substructure on one side moving 
at 1000 kms~^ relative to the cluster center. These are dropped from all further use in this paper. 
The scaled velocity-radius data for the remaining clusters are displayed in Figure |l] plotted using 
the brightest cluster galaxies (BCGs) as the projected center and the weighted average redshift as 
the velocity center. 



4. Background Subtraction 

All of the cluster properties presented in this paper are measured using galaxies with redshifts. 
The clusters typically extend over a velocity range of 5000 km s~^, which in redshift space causes 
orbitting cluster galaxies to be intermingled with field galaxies within a 50h~^ Mpc distance 
interval around the cluster. Background and foreground galaxies that are projected into the 
redshift space of the cluster must be statistically removed. This survey was deliberately placed 
at moderate redshift to give a field sample over a considerable redshift column from which the 
background density can be accurately measured. The background density is measured in the 
normalized velocity-radius space of Figure ||. It should be noted that the sampling in Figure || 
varies substantially with both velocity and radius which is addressed with redshift cutoffs and 
magnitude and position dependent weights ( |Yee, Ellingson &: Carlberg 199(| ). The background 
and foreground are folded on top of one another within the unbiased redshift sample range of the 
band limiting filters which removes any linear gradients present in the redshift distribution. 

There are two approaches to background subtraction used in the following calculations. For 
the surface density calculation we use the straightforward average over the sky area observed. 
The velocity dispersion calculation takes a more cautious approach to background subtraction. A 
resolved background is derived in precisely the same range of projected radii for which ap{R) is 
calculated. The advantage of this approach is that the contamination of the cluster velocity profile 
with various nearby groups of galaxies is subtracted in the range of projected radii where they 
contribute to the background. Furthermore, the ap{R) calculated from non-overlapping radial 
ranges are completely independent of each other. 



5. The Average Density Profile 

The surface number density profile of the average cluster, Ti]\}{R), is the projection over 
velocity onto the radius axis, i?, of the data of Figure |l[ To find T,iy{R), we sum the statistical 
weights (which correct for nonuniform sampling, Yee, Ellingson fc Carlberg 1996| ) for objects 
within ScJi of zero velocity, divide by the bin area, and subtract the mean background. We prefer 
the BCGs as the projected centers ( Bird 199^ ), but repeat most of the analysis using the peak of 
the smoothed density profile as the centers, which gives a useful view of systematic effects in the 
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analysis. 

The mean background volume density in velocity space, pi, is calculated by summing over 
the entire range of R/r2oo using the background in the velocity range of 5 to 25 ai. We find 
that pj^ = 0.0065 ± 0.0008, in units of the normalized total numbers per ai per r2oo^- Figure ^ 
shows the variation of the average background density with velocity separation (in normalized 
units) from the cluster. Between 3 and Gdi, the background calculated from the full sample (left 
panel) is higher than expected for a high velocity tail like a Gaussian. This excess is statistically 
significant at about the 3 standard deviation level, and results from various small groups that 
happen to be behind the low redshift clusters. The right panel of Figure § shows the background 
calculated after the 7 clusters with z < 0.27 are dropped. The rapid decline of clustering with 
redshift leads to a smoother distribution in redshift in the right panel. The mean background 
is calculated over the entire 5 to 25a"i interval for the full sample, so a local excess makes little 
difference. As a sidelight we note that near the cluster center, l-3a"i, the velocity distribution is 
possibly somewhat "squarer" than a Gaussian distribution. That is, for a Gaussian the ratio of the 
numbers between 1 and 2ai and to lo"! is about 1/3, whereas we have more than 1/2. This is 
likely an indicator that the cluster velocity ellipsoids are somewhat radially anisotropic (Dubinski 
1993i iCrone, Evrard fc Richstone 1994| ). 



To check the sensitivity of the derived T,]\f{R) to the assumed background level we repeat 
the calculation using values for pi of 0.0048 and 0.0082, which are 2 standard deviations from 
the mean value. The only part of the profile substantially affected is beyond r2oo, with alternate 
values being just within the 1 standard deviation range of the estimated density at that point. 
That is, for our adopted bin widths the statistical error due to sample size generally exceeds 
the uncertainty in the background subtraction. The fitted profiles described below have scale 
radii altered by 1.5% for a 26% variation of the background. We conclude that the background 
subtraction is not a dominant source of error for T,j\f{R). 

The background subtracted T,j\f{R) are displayed in Figure ^ and tabulated in Table ^. The 
vertical normalization of T,j\[{R) is a fixed, but arbitrary, value. In Table |2| the number density Eg 
(column 2) is measured about the BCG, and Sp (column 4) is measured about the peak of the 
45" Gaussian smoothed density map. Within 0.1r2oo of the center the density profile is strongly 
dependent on the choice of center. Because there is usually no galaxy at the point of peak density, 
there is a slight dip in the central surface density when measured about the peak of the smoothed 
density distribution. The choice of center makes little difference to the measured T,n{R) at radii 
beyond 0.1r200) approximately WOh~^ kpc, but it does affect fits to a specific model. The point of 
peak density generally has sufficiently large errors that it could be consistent with being at the 
BCG, hence we always prefer the BCG center for physical reasons ( pird 19941) . Outside of 2r2oo 



the surface density is comparable to the background but the data are very sparse, which together 
cause the surface density at large radius to be quite uncertain. No data beyond 2r2oo ai"e used 
in the velocity dispersion analysis. The surface luminosity density profile was also derived; but, 
being weighted to the brightest galaxies, it is a noisier quantity than the number density. We find 
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that the luminosity density profile shape is essentially the same as the density distribution. 

The spatial galaxy number density as a function of radius, z^(r), is needed to compare the 
galaxy number profile with the cluster mass profile. The relation between the volume density and 
the surface density is, 



E^(i2) = 2/ z.(r)-^=_dr. (9) 



R 



A statistically adequate, analytically convenient, model that describes these data is the Hernquist 



(alternatively designated as an = 2) model (Hernquist 199C, Tremaine et al. 1994). We fit the 



data centered on the BCG of Figure ^ to the projection of the volume density model, 

^^(0 = , ^ (10) 
^ ' r(r + o)3 ^ ' 

This procedure for finding a u{r) consistent with the Tii^{R) data is not unique, there being both 
the minor variation of using one of several other entirely reasonable fitting functions, or, adopting a 
completely different approach such as the non-parametric, maximum likelihood method ( |Merritt"fc| 
piVemblay 1994 ) modified to allow for background subtraction. The function v{r) = Ar~'^{r + a)~^ 



(Navarro, Frenk & White 1995) is also a statistically acceptable fit with a slightly increased \^ . 



Our measured T,n{R) falls faster than this function predicts at large R, but at these radii our 
measurement is relatively uncertain. 

Fitting the surface density data with Equation |l^ finds that a = (0.66 it 0.09)r2oo for the 
clusters centered on their BCGs. The reduced chi-squared statistic, /uf, is 0.74 for rij = 17 
degrees of freedom, which is an acceptable fit. Using the T,n{R) measured about the peak light 
gives a considerably larger scale radius, a = (0.97± 0.10)r200) with /nj = 1.01. This larger value 
of a is at least in part due to a blurring of the density profile by statistical errors in measuring the 
position of peak number density. Although we believe it is a less appropriate choice for the cluster 
centers for these data, we carry it along in the analysis as an indicator of systematic errors. 



6. The Velocity Dispersion Profile 

The measurement of (Tp{R) is a second moment of the velocity distribution about the local 
mean velocity at projected radius ii, although the choice of central moment makes essentially 
no difference to the result. The projected velocity dispersion at R, ap{R), is calculated using 
an iterative clipping of the high velocity dispersion tails with a technique similar to that used 
for the RMS velocity dispersion in P96| , where ai is calculated from the background subtracted 
data within riccr of zero velocity, where ric = 3. The iteration begins with ap{R) = 1 in the ai 
normalized velocities. For the next four iterations the (Tp{R) from the previous iteration is used 
for clipping. For a 10% change in the clipping level, ric, the resulting velocity dispersion changes 
about 1% at large radii. The error flags are calculated from a full bootstrap resampling of both 
the cluster and fleld data. 
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Rather than radiahy binning the data, we use a moving average, which includes all galaxies, 
both cluster and field. Table |3| gives the results with a 51 point moving average which are plotted 
in Figure ^ at intervals of 0.1 in r2oo- These points are always independent for 51 point averaging. 
Averages using 31 to 201 points give consistent results, the 31 point result being quite noisy, and 
the 201 point beginning to smooth the velocity dispersion gradient. The results of a 101 point 
moving average are shown in Figure |5[ For 101 point averaging the error flags are not independent 
outside of 0.4r2oo- 

The random errors of the derived (Jp{R) of the ensemble are dominated by cluster to cluster 
fluctuations. Within an individual cluster (for instance A2390) the random errors are substantially 



smaller, but there are large differences in velocity dispersion profiles from cluster to cluster ( |den| 
Hartog 1995| ) which we attribute to projection effects and substructure. At large radii, dynamical 



measurements from a single cluster likely do not give reliable enclosed masses because of the 
complications of infall, triaxiality and substructure. The idea of creating the ensemble cluster 
is of course to diminish these systematic variations to the level where an effectively spherically 
symmetric average profile emerges. 

To derive a (T.f{r) which is consistent with the observed (Jp{R) we use the same approach as 
we used for the density profile. That is, we choose a reasonable functional form, then adjust 2 
parameters, a scale length and a normalization, until its calculated projection, 

poo d2 „ 

ap\R)^N{R)= u{r)al{l-f3^)^==dr. (11) 



R 



r^Wr^-R^ 



minimizes the x^- this equation P = 1 — cr'l/a'l is the velocity anisotropy parameter. For the 
purposes of this paper (3 will be taken to be a constant, although we do use a range of values. 

The adopted (Tr{r) function should be finite at the origin, at large r it should tend to a 
Keplerian r^^/^ for a convergent mass distribution, and it should be a simple smooth function 
in between. It is important that this function not assume that the galaxy populations are 
self-consistent with the mass density of the potential. We adopt the simple form, 

^'(0 = (12) 

for the radial velocity dispersion. This function, with B= Y4 and assuming 6 = a, is the solution 
of the Jeans Equation (Equation |l3| below) for (3 = Y2 and A = l/(27r) in our assumed v{r). The 
scale lengths h and a are fitted separately, usually finding that h is larger than a. 

The nonlinear fitting procedure requires that the errors be symmetric about the observed 
value, so we take the larger of the upper and lower error range at each point. The fit is done 
assuming (3 = 0,0.25,0.5 and 0.75. The projected fit and the 51 point data are shown in Figure 
The /nf of the two parameter fit, an amplitude and length scale, is 0.48 for all j3 values. 
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7. Mass-to-Light Profiles 



To compare the relative distribution of mass and light we integrate our fitted u{r) to give 
-L(r), the "light profile" (actually a number density profile), which is converted to a mass profile 
using a normalized global M^/L, calculated in the same manner as for the individual clusters. 
Then we compare the mass predicted by the light-traces-mass assumption to the M{r) derived 
from the fitted cJ,,(r) and i^(r) using Jeans Equation, 



M{r) 



air 



d In dinv 

-TT^ + -IT- + 2/? 

amr amr 



(13) 



We will usually refer to the mass derived from this equation as MsHoif)-, the stellar hydrodynamical 
mass. 

It must be borne in mind that Equation |l^ is a moment of the Collisionless Boltzmann 
Equation, and will not necessarily give a physical result for all /?, being particularly susceptible 
to failure for very radial orbits (Binney & Tremaine 1987, plernquist 199C| ). In addition, radial 
models are usually dynamically unstable to the radial orbit instability ( Barnes, Goodman fc Hutl 
1986 , Palmer &: Papaloizou 19*8^ ). However, the galaxies are only tracers, so in principle they 
could be on much more radial orbits than the underlying dark matter. On the other hand, it 
seems likely that galaxies orbiting past the center may be tidally destroyed or simply well mixed 
so that the galaxy velocity ellipsoid near the center may become less radial than the dark matter. 



7.1. Dynamical Modeling Details 



Equation |1^ is integrated with the boundary condition that cTj. ^ as r ^ oo. Alternative 
boundary conditions are that the velocity dispersion goes to zero at the turnaround radius 
( Gott &: Gunn 1972| ), or, that there is an outer region of nearly radial infall which makes a 
transition to virialized motion ( den Hartog 1995| ). N-body simulations {e.g. Crone, Evrard & 



Richstone 1994, Cole &: Lacey 1996| , Zembrowski fc Carlberg 1996| ) show that on the average, 
the approximation that the virialized motion continues to r2oo is a good one. For constant (5 



Equation 13 has a formal solution. 



2, , GM{x)v{x)x^'^'^~'^Ux 



<{r) 



/y(r)r^^ 



(14) 



Satisfying the Jeans equation is a necessary condition for an equilibrium to exist, but it is not 
sufficient and is certainly not a guarantee that a spherical distribution is dynamically stable. 
Models with a constant /3 ^ ^/a are unphysical, because the implied mass is negative in the inner 
regions (very radial orbits lead to u{r) oc r^^, not the r^"^ here). This, along with the results 
of numerical simulations, leads to our preference for < /5 < The model lines plotted on 
Figure ^ are for /3 = 0, Y2 and 1, assuming M{r) oc L(r) in Eq. 14, using the fitted u{r) from 
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Eq. 10, and normalizing the projected RMS velocity dispersion over the observed radial range to 



the observed value. 

The numerical complication in deriving M(r) from Equation |l^ is that the logarithmic 
gradients dlnir^/dlnr and dlnu/dlnr must be calculated. Numerically evaluating these gradients 
from the data requires some form of smoothing to be stable, which our fits to smooth functions 
effectively accomplish. We will carry the errors of the projected ap data through the analysis to 
help give an assessment of the result. 



7.2. Velocity Ellipsoid Anisotropy 



The radial dependence of the shape of the velocity ellipsoid, as defined by the value of (3, is 
not empirically known for this sample, although this does not introduce the same uncertainty as 
it does for finding central dark masses. As the velocity ellipsoid becomes more and more radial at 
large radii, the mass implied for a measured ap increases. The inferred mass has nearly a full order 
of magnitude uncertainty when a velocity dispersion is available at only a single point with no 
knowledge of the shape of the velocity ellipsoid (Merritt 1984, [Richstone Sz Tremaine 1984 , The 



fc White 1986| ). Because we have measured the velocity dispersion over a wide radial range the 
uncertainty due to velocity dispersion anisotropy becomes quite small, as shown below. 



N-body simulations of objects that form via collapse find that /? > ( Richstone &: Tremaine) 



1984), although it is clear now that "vacuum" boundary conditions give far more velocity 



anisotropy than is realistic in a cosmological setting ( Dubinski 1993| ) where substantial external 
torques provide angular momentum to large radius orbits. Furthermore, strongly radial models 
are unstable ( [Barnes, Goodman &: Hut 1986 , Palmer &: Papaloizou 198^ ), although galaxies, being 
nearly massless within a cluster, could have a different velocity ellipsoid than the dark matter. 
For the nearly power law hierarchical clustering that characterizes the growth of clusters in model 
universes one expects that the velocity ellipsoid will have only a weak radial dependence, because 
cluster buildup is self-similar. This is borne out in n-body simulations ( Dubinski 1993 , [Crone, 



Evrard & Richstone 1994, ^embrowski fc Carlberg 1996 ) which find < /3 < 0.5 over the radial 
range of interest for a wide range of f^o and clustering models. The depletion of HI in Coma 
spirals suggests ( [Pryor fc Geller 1984[ ) that their velocity ellipsoid is not strongly radial, and is 
consistent with being isotropic. 



7.3. The Average Mass-to-Light Ratio within r2oo 

A test of the accuracy of the virial mass as an estimator of the total mass virialized in 
the cluster is whether the product of the virial mass-to-light ratio computed from our average 
cluster and the integrated interior luminosity, L(r), is equal to the stellar hydrodynamical mass. 
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MsHDir), derived from Eq. That is, we derive the virial mass bias, bMv{f), from 

, , . MsHpjr) L 

bM.ir) = -^J^^j (15) 

where L[r) is simply the volume integral of v[r). The normalization of L{r), L, is arbitrary since it 
cancels in Eq. |l5|. The normalizing virial mass, My, is derived for the ensemble cluster in the same 
way as for the individual clusters. Two aspects of bMv{f) are of primary interest. First, 6Aft)(?'20o) 
should be unity if the M^jL calculated from the average cluster is an unbiased estimator for all 
material inside r200) the radius expected to contain most of the virialized mass. Second, the radial 
gradient of huviy^ measures to what degree light traces mass in the average cluster. The first issue 
is the one of most relevance for the problem of estimation. It should be noted that whatever 
the value of bMv{^) turns out to be here, it is a quantity that is meant to give the systematic 
error in measuring cluster mass using red selected galaxies in rich, reasonably relaxed clusters. It 
would be inappropriate to apply the same factor to merging clusters or a galaxy sample selected 
to emphasize high star formation rates, for example. 

The galaxy number density profile, which we will call the light profile, is normalized to a mass 
using a mass-to-light ratio calculated in the same manner as done for our clusters individually. 
There are two complications in this calculation which need to be examined. First, the clusters are 
not all sampled to the same fraction of r2oo- Second, both the and the total light are calculated 
without any allowance for background contamination. To investigate these effects we calculate 
My/L using the mean profile, both with and without background subtraction, limiting the data at 
various Rmax/ 1^200 ■ The results are shown in Figure ^. 

Remarkably, the ratio M^/L is almost entirely independent of background contamination and 
sampling radius, once i2max/^200 ^ 0-5- The lower line on the figure is made using the surface 
density without background subtraction, which causes f„, hence My, to be overestimated, but 
the summed luminosity is also overestimated. The pleasing result for this sample is that the two 
overestimates almost exactly cancel out so that My/L calculated in the usual manner has no 
sensitivity to background. Furthermore, My/L has very little dependence on the sky coverage 
of the cluster, provided that the data extend beyond 0.5r2oo- Figure ^ shows that the directly 
calculated average My/L is high in the inner regions, but then drops to a very stable value at 
large radius. The normalizing virial mass is My = Saffy, where G = 1 and the parameters are 
the measured di and f„ for the combined cluster data. We use the quantities for the full sample, 
with background subtraction, although this choice makes little difference to the results. With this 
normalization, 6mi;('^20o) should be unity if My is unbiased. 



7.3.1. Virial Mass Bias 



The internal My/L bias, bMv{f), from Eq. is displayed as a function of radius in Figure ^ 
for /3 = 0, 0.25, 0.5 and 0.75. The bias would be unity everywhere if the product of My/L and L{r) 
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was equal to the stellar hydrodynamical mass. Note that if /3 ^ 2/3 the implied mass is negative 
at small radius. 

The primary goal of this paper is to correct the virial mass-to-light ratio to the field M/L 
value. The average virial mass bias for the bulk of the virialized system estimates the systematic 
offset in the virial mass, and is reasonably estimated as the value of hMv evaluated at r200i 
displayed in Figure |8| as a function of the assumed j3. Because the gradient is small the exact 
radius used does not make a big difference. The error in 6m'u(''20o) is calculated from the error in 
the fit to the B coefficient in o",.(r), the error in the other quantities being negligible in comparison. 
The 3 symbols are for absolute mag nitude cutoffs of =-18.5, -19.0, and -19.5 mag, respectively, 
of the entire calculation. The points offset in /3, with the dashed error flags, use the peak density 
as the centers and are always poorer fits, hence have larger errors. These points are included to 
illustrate the small systematic differences. All the results are statistically identical. 

We find M^, as normally calculated, is always an overestimate of the mass, whose value is 
weakly dependent on the assumed (3 value. For this sample the bias is minimal {i.e. hMv closest 
to unity) near /? < V2 for which it is equal to 0.82 it 0.14. All (3 give statistically consistent bias 
values, although there is clear trend of decreasing bMv with increasing [3. 



7.3.2. The Mass-to-Light Ratio Gradient 

The inferred radial gradients of bMv for different P are shown in Figure ^. The gradient is 
calculated from the slope of the lines in Figure |^. Only for very radial orbits, /5 > ^/s, is there 
even a mildly significant rising mass-to- light ratio with radius, and those are the models with 
negative central mass. The gradients are sufficiently small that for a wide range of /? they are 
consistent with there being no gradient of M/L. In no case are the gradients sufficiently large that 
there is any possibility that the cluster has a dark matter halo that is significantly more extended 
than the visible galaxies. 



7.4. A Model Calculation of the Virial Mass Bias 

We have found that the virial mass, as we have calculated from the data, is about a 20% 
overestimate of the true mass. The origin of this factor needs to be quantitatively understood. 
We believe that this bias is simply the approximation that the surface pressure term in the 
virial equation is zero. For our fitted functional forms for the density profile and the velocity 



dispersion profile, (Equations 10 and|T^, respectively), we can provide an estimate of the expected 
"truncation bias". 

Truncating a density profile of the form Eq. |l^, p{r) = Ma{2TTr)^^{r + a)~^, will cause the 
true mass contained within that radius to be overestimated, as quantified below. The potential 
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energy scalar, W{x), of the density distribution truncated at x is, 



. r ^ , GM^x^ (x + 4a) 

An pM{r)rdr= / ^ (16) 

JO 6a + a) 

Calculating the virial radius as — M (yX")^ /W (yX) we find that 

The estimated virial radius is 6a if the distribution is sampled to infinity distribution, whereas for 
our combined clusters we have measured the virial radius to be 3.9a (using the a found from the 
fit of the distribution with background subtraction). This is consistent, within the errors, with our 
truncation radius a; ~ 4a and Eq. 17 for the combined sample. The clusters as individuals all have 
truncation radii less than or equal to this. 

If the galaxy distribution follows the mass density the mass contained within x is estimated 
from the "classical" virial theorem relating the kinetic energy, T(x), W{x), and the surface term, 
3PV{x), through 2T{x) + Wix) = 3PV{x), as M^(< x) = 2G-'^T{x)r^{x)/M{x). The usual 
assumption for dynamical systems is that 3PV{x) is evaluated as a; ^ oo, where it is zero. This is 
not correct in detail for clusters of galaxies, where this term is a significant correction. 

For the particular case of /3 = ^/2, it was noted above that the solution of Jeans Equation is 
cr^(r) = GM/[4{a + r)]. Hence the truncated kinetic energy, T(x), is, 

r / N 2/ N 2 , GMx^ (x + 3a) , , 

Jo 12a [x + a) 

For the case of the mass being distributed like the galaxies, the ratio of the estimated mass 
contained within x to the true mass contained is, 

My{< x) ^ {x + 3a) (x + a) 
M(x) ~ x{x + 4a) ' ^ ^ 



Note that this expression could also be derived from 2T + W = 3PV, as discussed in p96| . At any 
finite radius this estimated mass is greater than the true global value. At x = a, 2a, 3a and 4a it 
evaluates to 8/5, 5/4, 8/7 and 35/32, respectively. That is, the mass estimated from the averaged 
distribution is expected to overestimate the true value by about 10 to 20%. Clusters that are not 
sampled to large radii will have a larger truncation bias but smaller background contamination in 
the projected quantities. The data in most of our clusters extend to 1 to 2 r200) roughly consistent 
with the measured bMv 



8. The Radial Gradient of Galaxy Populations 



A crucial step in the derivation of 0, from cluster M/L values is to correct for any differential 
luminosity evolution between the cluster and field galaxies. The luminosities of galaxies per unit 
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total mass can, in principle, either increase or decrease depending on what happens when galaxies 
fall into a cluster. The empirical evidence is that once galaxies have spent some time in rich, high 
X-ray luminosity clusters like those in our sample, star formation largely ceases. However, there 
are at least two routes to this end point for a gas rich disk galaxy. Galaxies entering the cluster 
could have their gas largely stripped away, or, a burst of star formation upon cluster entry could 
boost luminosity prior to depleting the gas. In either of these cases, lowering the star formation 
rate leads to a decrease in a galaxy's luminosity by an amount that depends on its entire star 
formation history. Galaxy merging always decreases numbers, but can lead to luminosity increases 
if accompanied by star formation. All these possibilities are testable in one way or another, and 
some substantial constraints on differential evolution can be imposed using the data at hand. 

It is well known that the fraction of cluster galaxies with blue colors is a strongly increasing 
function of redshift, the Butcher-Oemler effect ( [Butcher fc Oemler 1984 , Yee et al. 1995). It 



should be noted that cluster galaxies are, on the average, never bluer than field galaxies, implying 
that, on the average, star formation always declines into a cluster. If the infall of field galaxies 
into clusters induces a temporary increase star formation and hence increases cluster galaxy 
luminosities relative to the field, then it should be apparent as a change in the color distribution 



(and the strength of the [Oil] lines). This is not seen in our data {e.g. Abraham et al. 1996 ). 
On the basis of these considerations we expect that cluster galaxies are of intrinsically lower 
luminosities than the field galaxies. 

Our sample of galaxies extends from the cluster center to the distant field, allowing population 
gradients to be examined as a function of a redshift space radial variable, s. A convenient 
definition of such a variable is based on our normalized co-ordinates, where velocity differences are 
scaled to ai and projected radial co-ordinates are scaled to r200; so that the dimensionless redshift 
space separation from the center is = i?^/r2oo^ + where R is the projected radius 

from the center and At; is the rest-frame velocity difference from the average redshift. Note that at 
large s the galaxies come exclusively from the redshift direction. Figure ^ shows the color-radius 
relation of the full sample. All galaxies with k-corrected absolute r magnitude, M/^, brighter than 
— 19 mag are plotted. The colors are corrected to an empirically normalized redshift-independent 
color, 

(g - r)z = {g- r)[1.2 + 2.33(z - 0.3)]-^ (20) 



(|C96D . We note that there is no color gradient in the field, but one appears at s ~ 2, where we 



expect galaxies to enter the virialized region of the cluster. There is no evidence in the colors for 
excess star formation at the edge of the cluster; indeed, there are no cluster galaxies systematically 
bluer than field galaxies. Hence, the distribution of galaxy color with radius (Fig. |l^) is consistent 
with infalling galaxies terminating their star formation with little or no "starbursting" once they 
enter the cluster, as detailed analysis of the CNOC data for A2390 has demonstrated ( [Abraham 
et al. 1996[ ). 



A sample extending to a larger projected radius is needed to directly demonstrate kinematic 
evidence for infall of galaxies into the clusters. However, given the gravitational mass of the cluster. 
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the turnaround radius is straightforwardly derived ( pott fc Gunn 1972| , Kent fc Gunn 198^ ) such 
that field galaxies must be falling into the cluster out to the turnaround radius, approximately 
4.6r2oo for the model profile here. To a significant degree the cluster must be composed of former 
field galaxies. In such an infall scenario the cluster galaxies statistically share the star formation 
history of the field, prior to cluster entry. 

The only gross difference between clusters and field is that clusters contain an unusual 
population of luminous central galaxies. The BCGs only make a difference at small s, and 
generally contribute well less than 10% of the light. In spite of the substantial radial color 



gradient of Figure |10| there is very little change in the r band luminosity distribution of Figure |ll 



Measurements of the surface brightness evolution of both disks and bulges arrives at the conclusion 
that the amount of differential evolution between the field and the cluster luminosities is relatively 
smaU ( Pchade et al. 1996a| , |1996b|) . 

There is no significant difference in the field galaxy luminosities or colors at any radius 
beyond 2r200i as shown in Figure 11. That is, the field galaxies near the cluster appear to be 
indistinguishable from those far away. A generic prediction of "natural bias" models of cluster 
galaxy formation is that the galaxies that eventually fall into the cluster should form earlier 
because the overdensity of the incipient cluster speeds up the local time scale relative to the mean 
universe. Possibly this leads to a higher rate of conversion of gas to stars, causing cluster galaxies 
to be more luminous than their counterparts in the field. The color gradient appears at about 
2^200) approximately where one expects cluster X-ray gas to be encountered. 

Any fading or brightening of galaxies as they fall into the cluster cannot be large. Without 
the BCGs we measure a mean in the average cluster, r < r200) which is statistically identical 
to the field. However, this averages together blue and red galaxies, which contribute differently to 
the field and cluster, masking a real fading. Splitting at {g — r)z = 0.7 (Eq. ^) finds that both 
the blue and red galaxies show a decline in their mean brightness, above our limit, of 0.11 mag 
(±0.05 for red, ±0.07 for blue) between field and cluster. Splitting the sample into a low and high 
redshift sample shows that the fading between red galaxies in cluster and field increases to about 
0.2 mag at low redshift. A full luminosity function fit (H. Lin, private communication) finds a 0.2 
mag field to cluster fading, but with large errors correlated with the faint end slope. In any case 
there is no evidence within these data for an excess in stellar luminosity per unit dark matter in 
cluster galaxies over field galaxies. The corrected r band mass-to-light ratio in the field and these 
clusters does not vary much simply because there is so much "old" red light which is not altered 
during cluster infall. 



9. Corrections and Error Analysis 



The estimate ( |C96D from cluster virial masses and the conversion of the cluster luminosity 
(or numbers of galaxies) to an equivalent co-moving volume in the field depends critically on the 
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accuracy of the virial mass and the average luminosity of field and cluster being identical or having 
a measurable differential evolution. The expression for $7 that we used as our best estimator is 
Q, = {{My/L))/{pc/j). The field luminosity was estimated to be pc/j = 1136 ± ISS/iMq/Lq where 
the errors were determined from a Jacknife analysis. The value and random errors of {My/L) were 
estimated as 289 it 50/i M0 / Lq . From these results we concluded that the virial mass estimator 
gives = 0.25 it 0.05. However, this result needs to be corrected for errors in M^/L relative to 
the field. 

We find that the virial mass needs to be multiplied by 6mv('^20o) = 0.82 it 0.14 to give the 
SHD mass at r2oo- The correction is a consequence of measuring the virial mass from a truncated 
distribution and using data beyond r2oo- The values of buv for — 1</3<3/4 are all within the 
errors of this value. We also find a significant mean luminosity differential between field and 
cluster which requires that the cluster luminosity be boosted by 0.11 it 0.05 mag to give the 
field value. Together these corrections reduce M^/L to a field M/L = 213 it 59/iMq/L0 and 
hence VLq = 0.19 it 0.06. The errors in the various quantities are given in Table along with 
their quadrature and linear sum. The main errors in our result are simply the statistics of the 
luminosity density and the average cluster M^/L values. The systematic errors are estimated from 
the extreme versions of the analysis that we find for alternative f3 values, choice of center, and 
color and redshift samples of the galaxies. To obtain an value accurate to about 10% would 
require a sample approximately four times larger. 

The total random error, 30%, is estimated from the quadrature sum of the individual errors. 
Some of the component errors could be partially correlated which would increase the total error 
somewhat. The worst case possibility is if all the errors are completely correlated, in which case 
the error is the linear sum of the errors, 73%. This must be a substantial overestimate because 
many of the errors come from completely uncorrelated sources. 

The systematic errors in Table |^ are harder to estimate. Other choices of density centers 
and fi values give answers that allow error estimation for these parameter variations. The least 
satisfactory situation is the estimate of fading between cluster and field. Our observations contain 
no evidence for excess star formation between cluster and field, so we feel that an increase in the 
average light can be ruled out. Similarly, fading of much more than 0.5 mag in our r band light 
would lead to such gross differences between cluster and field that it would be immediately visible. 
Furthermore, for such a large fading to occur would likely require a stellar population weighted 
towards short lived luminous stars. To constrain further the differential luminosity evolution 
requires more detailed observations of the individual galaxies. 

10. Discussion and Conclusions 

The overall goal of the CNOC survey is to use clusters of galaxies to derive a value of VLq with 
a well determined error budget with particular emphasis on eliminating systematic errors. The 
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major innovations of our analysis are that it is completely self-contained, with most assumptions 
being testable, and that the error estimates are derived from the data themselves. The dominant 
source of error is random cluster-to-cluster variations due to projection and substructure, rather 
than the internal error from individual clusters. The global mass-to-light ratio (in our photometric 
system) of our sample clusters is constant within our typical errors of 20-30% at a value of 
289 ± 5O/1M0/LQ. Over the same redshift range we measure the closure value, pc/j to be 
1136 lb I38/1M0/L0 ( |C96D . We have shown that at r2oo that the ratio of the dynamical mass to 
the cluster light, 6mv('^20o) = 0.82 it 0.14, with essentially no dependence on assumptions about 
velocity anisotropy. After allowing for the fading of cluster galaxies relative to the field, 0.11 ±0.05 
mag, and the bias in estimating we estimate that M/L field is 213 it SD/iMq/Lq, implying 
ilo = 0.19 lb 0.06. The objectively evaluated random errors and estimated systematic errors are 
given in Table ^ 

A strength of these data is that they cover the entire virialized cluster, extending out to 
where the overdensities are low, about \hpc{z), and hence reach to sufficiently large radii that the 
cluster M/L should represent the M/L of the infalling material. If segregation of dark matter 
mass and galaxies somehow occurs during infall, prior to to virialization in the cluster, then that 
could produce an artificially low M/L. We find that the galaxy population outside the cluster has 
no measurable gradients. A benefit of this redshift range is that the galaxy populations in clusters 
more strongly resemble the field than they do at low redshift, tightening the M/L argument. 



Our r^o estimate is similar to the Least Action Principle result of VLq = 0.17 it 0.10 ( |Shaya, 



Peebles fc TuUy 1995| ) which probes larger scales which have much smaller density contrast. Our 



result is inconsistent with an ^Iq of unity in any "cold" matter (less than about 1000 km s^^) 
that falls into clusters. Furthermore the result is well below the 6 = 1 interpretation of the large 
scale streaming velocity results ( pekel 1994 , Strauss & Willick 1995, Fisher et al. 1994, Dekel 



et al. 1993| ). In particular our result in combination with the Cosmic Virial Theorem estimates 



of 0,0 /b ~ 0.2 ( Davis fc Peebles 1983| , Bean et al. 198^ ) implying that these galaxies are 



relatively unbiased mass tracers. Because clusters assemble their mass from regions approximately 
20h~^ Mpc across, the difference between the streaming results and our determination is 
substantial. That is, there is little room for extra matter which supports density perturbations on 
larger scales but does not enter clusters over our range of observed redshifts. 

The we derive is a reasonably comfortable 3 standard deviations above the upper end 
of the fraction universe in baryons, $7?,, ( [Walker et al. 1991 , |Copi, Schramm fc Turner 1995| ), 



as the X-ray mass measurements also indicate ( [White et al. 1993| , White fc Fabian 1995[ ). The 
X-ray analyses for the ratio of the X-ray gas mass, Mx, to the total mass, Mf, find M^/Mt ~ 0.09 
(approximately adjusted to /i = 1, but see [White fc Fabian 1995[ ). In combination with our results 
for r^o this line of reasoning gives 

Qb = 0.02/i3/2(o.2/S7o)(0.1/(iV4/Mt)) with an error of about 
35%. Consequently cluster baryons are consistent with other BBN indicators for /i ~ 0.7, although 
with a fairly large error range. 
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The fact that the galaxy numbers and the cluster mass are distributed in an identical manner 
rules out any significant velocity bias between the cluster galaxies and the dark matter. That 
is, a small (say 20%) decrease in the velocity dispersion of cluster galaxies relative to the dark 
matter would lead to a substantial segregation of the cluster galaxies relative to the cluster mass 
([West fc Richstone IQSSj Parlberg fc Dubinski 199ll, [Katz fc White 1993|, [Carlberg 19941). The 



conclusion to be drawn is that the galaxy tracers selected in O = 1 n-body simulations do not 
form like the galaxies in the real universe. The implications of this are unclear. It is possible to 
identify relatively unbiased galaxy tracers in n-body simulations ( Carlberg 1994| ) so the problem 



may be within the simulations. A more interesting possibility is that the understanding of galaxy 
formation and mass clustering that is incorporated into the simulations has some substantial 
deficiencies. 

There are several improvements in this analysis that could be made with more data, mainly 
at large radii. More data at large radii would reduce the variance resulting from substructure 
and sample all the clusters to similar r2007 diminishing the complications of the {M^/L) bias 
correction. With data extending beyond 5r2oo one enters the infall regime and expects to see 
the "compression effect" in the redshift space density contours of the ensemble cluster ( [Kaiser 



1987 , Regos fc: Geller 1989| ), which would empirically demonstrate that infall is occurring, and 



would give a measurement of ^^'^ /h in the nearly linear regime. With many more velocities in the 
cluster bodies the shape of the velocity ellipsoid can be directly measured, rather than taken from 
n-body simulations as was done here. 
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Table 1: Dynamical Parameters of the CNOC Clusters 
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* Very dependent on the angular size of the region sampled 

t Strong binary with erroneous velocity dispersion not included in average 

X Strong binary not included in average 
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Table 2: Ensemble Surface Density 
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Table 3: Line-of-Sight Velocity Dispersion (51 point) 
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Table 4: Error Budget 
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Fig. 1. — (NOT INCLUDED: see website) The ensemble cluster and field constructed from the 
normahzed sample, deleting MS0906+11 and MS1358+62 which are "binaries". A total of about 
1500 galaxies are plotted. 

Fig. 2. — The background density as a function of normalized velocity from the cluster center. The 
horizontal line shows the calculated mean value of the background over the interval 5 to 25c7i . The 
dashed histogram is the lower histogram times 10. The background for the full sample isin the left 
panel and for the sample that excludes the low redshift clusters is in the right panel. 

Fig. 3. — The surface number density profile of the ensemble cluster. The squares give the surface 
densities centered about the peak density, the circles centered about the BCG center. The line is 
the projection of the Hernquist function fit about the BCGs. 

Fig. 4. A fit to <Tp(i?) convolving a radial velocity dispersion of the function (T^(r) = (1 + r)~^ 
with the fit to the surface density for (3 of 0, 0.25, 0.5, and 0.75, as lines of increasing curvature. 
The error flags have been symmetrized and all are independent. The jagged line is the 51 point 
smoothed velocity dispersion. 

Fig. 5. — The 101 point moving average calculation of the projected velocity dispersion. The model 
lines are for a light-traces-mass model, with /3 = V2 (solid) and /3 = 0, 1 (at large radii, the upper 
and lower dashed lines, respectively). The line at the bottom is the absolute value of the local 
mean velocity. 

Fig. 6. — The My/L calculated from data of limited sky coverage on a cluster. The lower line is 
for background subtracted data. 

Fig. 7. — The ratio of the derived M(r) to L{r) for f3 of 0, 0.25, 0.5, and 0.75 (strongly circular 
to completely radial), from top to bottom at small radius, respectively for = —18.5 mag. The 
statistical errors are about 15%. 

Fig. 8. — The virial mass bias, bMv, at r2oo for varying /3. The symbols denote absolute magnitude 
cutoffs of -18.5 (triangles), -19.0 (squares) and -19.5 (circles). The offset points with the dashed 
error flags are about the peak of the number density. 

Fig. 9. — The gradient of the virial mass bias, bMv as a function of /?. The symbols denote absolute 
magnitude cutoffs of -18.5 (triangles), -19.0 (squares) and -19.5 (circles) which indicates the small 
changes for slightly different fits to the observed distributions. The points offset in /3 with dashed 
error flags are fitted about the peak of the number density. 

Fig. 10. — (NOT INCLUDED: see website) The redshift corrected color vs dimensionless redshift- 
radius. The BCGs are plotted at a uniform radius of 0.09 units. The average in bins of 0.2 radius 
unit is plotted as the solid line. 

Fig. 11. — The k-corrected luminosity vs redshift-radius. The average above the luminosity limit 



plotted as a solid line. 
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Fig. 6.— 
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Fig. 7.— 
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